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(L) In a memoir on SkeAv Variation jjublished in the 'Phil. Trans./ A, vol. 186, 
1895j a series of frequency curves are discussed which are integrals of the difierential 
equation 

1 1. ). 



y dx c^ + i'3'V + 0^' 
(See p. 381 of the memoir.) 

The discussion of four main types is given in detail, and a brief reference is made 
to various sub-types which may occur. The types considered in that memoir covered 
at the time all the frequency series, and they were fairly numerous, that I had had 
occasion to deal with. In the course of the last few years, however, I have been 
somewhat puzzled by frequency distributions for which the criterion 2^^ -— 3^| -- 6 
(see p. 378) was positive, and therefore cl priori a curve of the type 

i + fj 

was to be expected, but which on calculation gave p imaginary. The frequency 
distributions in question arosef occasionally in sociological statistics, but also in 

^^ 'Phil. Trans.,' A, vol. 186, p. 343. 

t Some other frequency distributions, which on first investigation fell under Types V. and VI. of the 
present paper, were found with improved values for the moments to fall under types already discussed. 
Mr. W. F. Sheppard's values for the moments (' Lond. Math. Soc.,' vol. 29, p. 369, formula 30) should 
certainly be used in preference to those given by me ('Phil. Trans.,' A, vol. 186, p. 350) whenever we are 
calculating the moments of a curve from areas and not from true ordinates. I hope shortly to publish a 
paper on this point, which is one really of quadrature formulae. Meanwhile for every true frequency 
curve with high contad at both termuah we ought to use 

IH -- ('^ (^'/ - Vi~ - o) 

!H - c^{n - ^nv^ + 6n'^^^/ -" 3Fi'^ - 1(1// - Fi'^) + .-h), 

instead of the values given on p. 350, /xg remaining unchanged. 

(297) 3 L 2 29.11.1901 
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biological investigations. It seemed, therefore, desirable to enter a little more 
fully into the analysis of the cases in which the criterion was positive but v 
imaginary, and discover what types of frequency curves had escaped my attention/^ 

The key to the solution lies in the fact noted on p. 369 of the memoir, namely, that 
even if the criterion be positive, there Avill still be a solution akin to Type I. and not 
to Type IV. if e be negative. No frequency series satisfying these conditions had at 
that time come under my notice, and later, when collecting data of floral variability, 
my own remark as to € had slipped from my memory. It is the object of this 
supplement to obtain an improved criterion of type, to discuss the nature of the 
curves which fill the gap observed, and. to illustrate by one or two examples the 
fitting of such curves to actual statistics. 

(2.) The Tivo Criteria. 

Throughout this supplement the notation of the previous memoir will be assumed 
to be familiar to the reader. 

Turning to p. 378 of that memoir, we note that since /3y and r — 1 are necessarily 
positive, z if positive must be > r^ Hence v can only become imaginary if z be 

negative, or 

(3, (r - 2y 



16 (r - 1) 



> 1 



Substitute in this the value of r and it becomes 






Hence the complete condition that a curve of Type IV, shall give the distribution 
of frequency is not only 

but also 

^ 4 (4^3 ^ 3^j) (2^3 ^ 3/9i ^ 6) ^ ' 

Turning back to p, 369, we see that € being positive the complete conditions for a 
curve of Type I. giving the distribution of frequency are 

/c, ^ 2/3g -^ 3/3^ - 6 < 0, 

^ I was very loatli to adopt Professor Eugeworth's method of inventing new frequency curves by 
putting X = fix') in a normal frequency distribution, y = y^e-^'^-^^^ . Besides strong theoretical objections to 
this process, I had found Equation (i.) so sufficient for a great variety of cases that I felt confident it must 
cover the newly discovered outstanding cases, and this confidence seems justified by the result. 
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and 



'<-2 = 



/8i w, + 3y 



4 (4/3, - 3ySi) (2^, - 3^ - 6) 



< 0. 



The latter condition will be always satisfied since yS^ and 4^83 — 3^^ are positive for 
any distribution whatever, and 2^^ — 3/3y — 6 is negative by hypothesis. 
Further, in the previous case k^ is seen to be essentially positive. 
Hence the criteria written down cover all possible cases but those for which 

Sub-cases which arise from transition curves just at the Hmits will, however, be 
likely to be of interest, What happens when k^ = oo and when ^2 = 1? The only 
possibility for ^2=00 is 2/3^ — 3/3j — 6, or ac^ = 0. But this curve has been fully 
treated under Type III. in the memoir. 

We shall see later that k^ = 1 leads us up to a novel transition curve of consider- 
able interest. 

To ascertain something about the general case in which Kg > 1 , let us return to 
the memoir again and examine the value of € on p. 369. It can only be negative if 



4 + i/5i(r+2)V(r+l) be < 0, 



Avhere r is here 






Substituting, we find at once 



f<2> 1, 



which in itself involves k^ > 0. 

Hence the missing gap corresponds to those cases in which e is negative. 

It will be clear that Kg, although in form giving a more complex criterion than k^, 
is really more effective, as covering all the j)ossible cases. We have then the 
following scheme : — 



Criterion K2. 


Corresponding frequency curve. 


/<:2 = CO 

K2 > 1 & < 00 . . . . 

K2== I 

K2 > & < 1 . . . . . 

K2 ~ 0, ^1 - 0, (3^-3, . 
K2 = 0, /^i - 0, ^2 not - 3 

'<2<^ 


Transition curve, Type III. (Memoir, p. 373). 

Type yi. (see p. 448 below). 

Transition curve, Type V. (see p. 446 below). 

Type lY. (Memoir, p. 376). 

Normal curve. 

Type II. (Memoir, p. 372). 

Type I. (Memoir, p 367). 
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The object of this supplement is to discuss the calculation of curves of Type V,, 
and to consider those of Type VI. somewhat more at length, they beuig only briefly 
referred to on p. 369 of the memoir. It will be seen that Type I, of the memoir has 
now broken up into two divisions. One portion is the old Type I. passing into the 
normal curve on one side and Type III. on the other. This Type III. separates the 
second portion, Type VI., of the old Type I. from the first portion. Type VI. passes 
from Type III. to the new transition curve Type V., which, like Type III., will be 
found to have a range limited in one direction only. Finally this new Type V. is the 
transition to the old Type IV. bounded on the other side by the sub-curve, the old 
Type II., and beyond that the normal curve. Thus we see that Types I. and IV. do 
not pass directly into each other through Type III., as might be supposed by the 
criterion k^ > or < 0, but that there are a series of intervening curves, two of which, 
Types V. and VI., recpire further consideration, if we are to complete the whole 
round of frequency distributions embraced under the differential equation (i.). 



(3.) On the Frequency Curve of Type F, 

Returning to the fundamental differential equation (i.), let us consider what 
transformation takes place when the denominator on the right has equal roots. '^ We 
may then write it in the form 

1 dy —03 0^1 1 



y dx Cq (c^ -f x)^ 6'o (ci + xf c,, (c, + .%') * 

el 1 

Hence log y =^ — - : — ,-— r — ^ log (c^ + ir) + const. 



Thus y ^ 3/q e '' '^"''' (c^ + x) ^, 

where, ^q is a constant, y = c^^/cy and p =■ 1/cq. Thus changing the origin we may 
write the curve : 

«.«« ' -. / » B O \ 

w^here x^q = y/p gives the distance of the mode from the new origin. 

To find the moments about this origin, we notice that, p and y being positive, 
y rrn: wheu X = and when x =^ oo , Thus as in the curve of Type III. we have a 
range *limited at one end only. 

To find the moments we have, if a be the area, 

aiJL^„=^\ y^x^P'^" er'^^^uix ..,.-..... (iv.). 

Jo 

* I owe to Miss Agnes Kelly, Ph.D., the suggestion that this type of frequency curve deserved fuller 
treatment. 
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Put 



X ri Ho . 



^1 



ylx =: z, and we find 



•00 





y^Yi-p-^^Y{2^—n'—l) . 



a 



Voy^ ^r(p- I) 



r/ (p ■ 
yVip 
fl ip 
yVip 



■ 2){p~- 3) 
■2){p .-3)(p 

■ 2) {p - 3) (p 



4) 

4)(p 



• « « « • 



• « « « * » • 



5)/ 



» » 



(v.). 

(vi ). 



(viL). 



Transferring: to the centroid we find 



M2 



/^3 



/X., 





7^ 


(j» - 


-2)^(p-3) 




47» 


(jo- 


-2)3(p-3)(i.-4) 




3Qj + 4) 7* 



\ 



(^-2)*0^-3)(^-4)(i)-5)/ 
o „ 2/„ 3 _ 16 (y — 3 ) 

Pi — /^3 /iU'S "" (|3 - 4)2^^ • 



P2 — ^4/^ 



3 



3 0, + 4)(j 7-3) 
O - 5) {p - 4) 



• • « 



Eliminating }) between /S^ and /Sg we find after some reductions : 

A (A + 3)2 = 4 (2^, - 3,81 - 6) (4/3^ - 3^1) 

^r, /Co ^^^^^ i 



(viii.). 



fix ^ 



• • i^-)- 



• • » « 1 Jxl» Jt 



Clearly, since this is the condition for Type V., that transition curve is none other 
than the curve obtained by making the denominator of the right-hand side of the 
difierential equation have equal roots. The curve is clearly of considerable interest, 
and its existence had not been noticed in the previous series of frequency curves. 

The manner of fitting it is now easily described. 

Equation (ix.) gives us a quadratic to find p — 4 : 






• •••♦,, \ -A.ii, f. 



The positive root of this is the required solution. 
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y is then found from the first of equations (viii.), or if cr be the standard deviation 

= a//x2, then f'' 

y ■=z (J (p — 2) . \/(p — 3) ....,.., (xlii.). 
Then (vi.) gives : 

_ ^_^^':'^^ / • X 

//') TY?? W " * ' ' ' ' ' ' ' * i^xiv.j, 

which determines ^^/q, the remaining constant for the shape of the curve. 

For the position of the curve, we have for the distance from origin to mean, from 
the first of equations (vil.) : 

/x/ ==: y/{2^ — • 2) = o^x^ip — ' 3) ...... (xv.). 

If d be the distance from mode to mean we have : 

d = /x/ - y/23 

7 7 2j 



9, Qi 01 ( qi „^ 9, 



Further, the skewness : 



]j-Z f p{p 






Sk. = c^/cr = ^-^^-^- --^ . (xvii.). 



Thus the solution Is completed. 

(4.) On the Frequency Curve of Type VL 

Type VL, as we have seen, corresponds to the case In which Type I. of the memoir 
has its e negative. Hence either m/ or m/ is negative and the curve transferring the 

origin takes the form 

y ^ yo (x — ay^'/x'''" ........ (xvIII.). 

Now It Is possible that this curve falls under the limited range type of a frequency 
from ^ = to ^ = a, but as we see that the criterion places Type VL between two 
curves of range limited In one direction only, we expect Type VL also to be of that 
character, and a complete solution Is obtained by taking the range from a^ = c^ to 
cc = 00 ; this Indeed fills up the gap for k^ > 1 and < oo , and (xvlli.) with this range 
Is seen to pass Into one or other of the two transition curves 

y-y^xPe-y\ 
or y z=zy^x-per^-\ 

according as we allow the first or second factor to approach a limit. 



■^ The sign of ih will determine the sign of y, or, what may be taken as the same thing, the direction 
of the axis of x. 

t Write : y = const, x r^"*"^ (1 - a:/ct)"^i and make ri%\ ^ oo^ and a ~ co but riiija finite. 

Or, y =-- const, x (1 - ajxy'hlx^'h-m,^ ^nd make a -= 0, mi --= a:, and a x m^ together with riii - nii 
finite. 
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Accordingly we shall write Type VI. in the form 

2/ = 2/0 (^ "-" ctYi^"^' .... 

and take the range from a to 00 . 
Differentiating to find the position of the mode we have 



« • • 



(xix.), 



tiymn — — 



«ft 



'mo 



?l-?3 



• • 



....... I ^\..A « ft 



For the moments about the origin : 



Put ct/x =: z, hence 



^l^n = Vo z^^ dx. 






;^^ri^^ 






Hence we deduce 



a 



^0 r(^i-^2-i)r(^2 + 1) 



a^i-93-i 



r(?i) 



• . » . • 



« I .2v.2v.l. Ii 



/^i 



i^i = 



^ (gi - 1) 

ft - fe - 2 



«'(ft-l)(!?i-2) 



/^3 



/^4 = 



(?i -23-2) (^1 - (?3 - 3; 

^' (gi - 1) (gi - 2) (gi - 3) 
(gi - gs - 2) (gi - ga - 3) {q^ - «?3 -- 4) 

^Hgi - IKgi - 2) (gi - 3) fa - 4) 



. 



(g] - g3 - 2) (gi - gs - 3) (?i - $-2 - 4) (ft - ?3 - 5) J 



(xxii,). 



Now if we compare these results with those on p. 368 of the earher memoir we see 
that the one set can be at once deduced from the other by writing m^ = — q^, m^ = Og, 
Thus with this interchange the whole of that solution holds, if we bear in mind that 
the range is now from x = a to <x . 

We easily find : 






= — g-i 4- g-o + 2 



e= 1 - q^^. q, - q^q^^ 



VOL. CXCVII. — A. 



3 M 
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and 1 — qi and g'a + 1 ^i'^ the roots of 

M / (V ""Y^ t "■■ '"""■"' Wees i)s«a«a I ^j?LxH« Fa 

where r and € are to be determined as in that memoir^ pp. 368-369. 



We have : 



/^3 — ^,2 (r + 1) ' ' \xxiv.;, 



where 1 -— q^ and r are both negative. This gives ar^ Thns q^^iqo and a are known, 
and from Equation (xxi.) 

we find the remaining unknown constant for the shaj)e of the curve, t/q. As before, 
various approximations may be used to the values of the r functions when either q^ 
or q^ or both are large, f 

We easily obtain for the distance between mode and mean 



d ^ ^(2l±&) /-xx^i \ 



and for the skewness : 



QU _ _(?i + ga) v/(gi :zJ'. - 3) , . . V 

fe-~g.V{(gi--l)fe+l)} [x^vn.). 



(5.) A special case of some interest arises when the start of the curve is a priori 
known. Suppose its distance from the mean to be c and let (using moments about 

centroid) 

ixjc^ — y^, I^b/{^I^zc) — y.^ . , . . \ . . (xxviii.). 

Then we easily find : 

1-^ gi _ ft. + 



(1 + &) (- ft + ^h + ^'0 ^ '^ ("1 + (/3)(ft - ^3 - 4) 



'^' 1 - cji being negative, e is negative, and accordingly by what goes before k2 lies between 1 and oo . 
t The value of yo for curves of Type I., if m,i be^small but ^2 large ('Phil. Trans.,' A, vol. 186, p. 369, 
foot-note), is 

and this can be easily modified to suit (xxv.) above. A very convenient and exact formula for T (n + 1), 
jf n be large, is that given by Forsyth ('B.A. Report,' 1883, p, 47) : 

I 

the error being less than -^rrjr^ of the whole. 
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W hence we deduce to determine qi and %'■ 

n — 1 - 3Y2 + 473 n JL-a — ^3 (1 + TaXVa - 1 - 273) , ^- N 

^^ ^^ 72 - 73 ^^ ^^ (273 - 73 + 7373)(73 - 73) ^ ^ 

and the solution proceeds as before. 

(6.) Illustrations. — I propose to note a few distributions of frequency in which I 
have come across Types V. and VL 

(A.) Statistics of Age of Bride at Marriage, the Biidegroora' s Age being betiveen 

24 and 25 years,^^ 

The observations given in the table, p. 454, are taken from Perozzo's memoir : 
''Nuove Applicazioni del Calcolo delle Probabilita . . . ," ' Eeale Accademia dei 
Lincei/ Anno CCLXXIX., 1881-2, Tavola L 

The total number of recorded marriages is 28,454. The moments were calculated 
by using Shbppard's corrections (' London Math. Soc. Proc.,' vol 29, p. 369), and are 
as follows :— 

Mean age of bride = 22*1 87f. 

' fi^^ 13*3346 
/X3=^ 67-8145 
li.^= 1224-6342 

Whence : ^Sj = 1*9396 

^3= 6*8873 
K^ = 1*9558 

K, = 1-1094 

^/ 

Thus by p. 445 we see that Type VI. is the frequency curve to be selected, but as 
K3 does not differ widely from unity, we shall probably get a good fit from Type V. 
as well. 

Taking Type VI. first, we find : 

r=- 12*11075, €=- 317-84987. 

The quadratic (xxiii. ) is accordingly : 

z^ + 12*11075 z - 317*84987 = 0. 

* I selected this example at random, as one out of several leading to the curve types it was my 
object to illustrate. There is so much tampering with statistics, however, whenever they refer to the ages 
of women, that it would probably have been better to have used the men. 

3 M 2 
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Thus: (71= 25-88401, q^=:z IV77S26. 

Hence by (xxiv.) a == 8*268,405, 

and by (xxv.) log y^ = 24*275, 3032, 

We have accordingly for the equation to the curve : 

y = 1021 X 1-884,965 ^^--~^~^£^- . 

The distance from the origin to the mean is given by the first equation of (xxii.) : 

^/ = 1 6*98913, 

or, the theoretical range starts with brides of 5^198,570 + 8'268,405 = 13'466,975 
years. This is an excellent underlimit to the age of women marrying men of 24 to 
25 in a country like Italy. Our first group is at IS'S, and the above start is just two 
base units before this initial group. 

The skewness = '498,953, and the distance from mode to mean =: 1 '822,004, or 
the mode is at 20*3657 years. 

Turning now to Type V. we have the following results :— 



16/^^:= 8*249,262. 
Hence Equation (xiL) is : 

(p _ 4)3 _ 8*249,262 {p ^ i) ^ 8'249,262 = 0. 

Thus the positive value of p is : 

p— 13*150,747. 

Equation (xiii.) gives : 

y = 129-7308L 
Then (xiv.) gives : 

log 2/o — 22*367,6952. 

Thus the equation to the curve is : 

y = 10-2^ X 2*331,821 a^-i3-i5o,747 ^^nmBOBy.^ 

To find the position of its start we have by (xv.) : 

^/= 11*6343, 

or, since the mean age of brides is 22*1877, the youngest possible theoretical bride is 
10*5534 years. This is probably a worse determination of the underlimit than in the 
case of Type VI, At the same time I notice that out of about 180,000 women, lOJ 
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were married between 14 and 15 years of age, and all the curves begin with a 
sensibly finite ordinate at 14*5 ; it is accordingly possible that a somewhat lower age 
than 13 '5 actually occurs in Italy. 

Equation (xvi.) gives us for the distance from mode to mean : 

d=zV769A, 

or the modal age at marriage is 20*4183 years. This is only about '053 of a year or 
about 19 days different from the modal age as given by Type VI., a most satisfactory 
aefreement. 

For the skewness we have from Equation (xvii.) : 

Sk. = -4845, 

or, it differs by less than 3 per cent, from the skewness as given by Type (VI.). 

The diagram (fig. i.) shows the two curves, and the table compares the results 
obtained from either with the observations.'^' 
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A^ of Bride, 



Type V. 



Type VI. 



It is clear that for all practical j)urposes the curve of Type V. is as good as that 
of Type VI. Indeed, there is pi'actically no difference between them except for the 

* The observation data are really areas, while to save lengthy calculations we have compared both in 
diagram and table the ordinates of the theoretical curves. This is in general legitimate, if, as in this 
case, the number of groups is very large. 
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ages 15 to 17. The fit is, however, not a veiy good one, and although it is 
indefinitely better than a normal cnrve, and we see why in the absence of these 
types the statistics could not be fitted with any of the first series of skew curves, 
yet we are compelled to consider that there are causes other than chance at work 
very definitely affecting the frequency of the recorded ages. Thus the bridegrooms 
being 24 to 25, the desire of the bride to be recorded as younger than her husband 
probably fully accounts for the bulk of the preponderance of observation over theory 

Table of Observed and Calculated Frequencies. 







Calculated frequency. 






Calculated frequency. 


Age. 


Observed 
frequency. 






Age, 


Observed 
frequency. 














" 




Type V. 


Type YL 

1 

49 


30=-31 


256 


Type V. 


Type VI. 

282 


15-16 


367 


70 


281 


16-17 


717 


514 


489 


31^32 


164 


201 


198 


17-18 


1294 


1538 


1560 


32^-33 


134 


148 


146 


18-19 


2121 


2751 


2800 ; 


33-^34 


94 


104 


105 


19-20 


3156 


3591 


3622 


34--35 


77 


75 


76 


20-21 


4009 


3830 


3831 


35-^36 


68 


55 


55 


21-22 


3593 


3577 


3560 


36-37 


59 


40 


40 


22-23 


3604 


3055 


3034 


37-38 


33 


29 


29 


23-24 


3060 


2456 


2439 


38-39 


40 


21 


• 22 


24-25 


1774 


1894 


1884 


39-^40 


27 


16 


16 


25-26 


1353 


1419 


1415 


40-=»41 


18 


12 


12 


26-27 


936 


1044 


1043 


41^42 


21 


9 


9 


27-28 


663 


758 


760 


42^43 


11 


7 


7 


28-29 


468 


546 


549 


43-44 


14 


5 


5 


29-30 


319 


392 


395 


44-45 

i 
1 


4 


4 


4 



in the frequency of the brides of 22 to 24. The defect of brides between 17 and 
20 may be again due to the tendency to state the age as over 21 ^ and so free 
the woman from the need for parental sanction. ^^ These causes, giving a false 
displacement of age frequency, are probably in themselves sufiicient to account for 
the theoretical defect in brides of 15 to 17. 



(7.) (B.) On the Variation in the Number of Lvps of the Medusa P» Pentata. 

My data are the following, taken from a paper by Alfred GoLBSBORogoH Mayer : 
" The Variations of a Newly Arisen Species of Medusa," ^ Science Bulletin of the 
Museum of the Brooklyn Institute,^ vol 1, p. 1, 190L 



^ I have found in England the statement of the bride's age in the marriage licence is for the same 
reason occasionally not in accordance with the year of birth as shown by the parish register. 
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Frequency. No. of lips 
2 1 

5 2 

18 3 

123 4 

798 5 

49 6 

1 7 



Total 996 

Mr. Mayer (p. 12) notes the failure of my curve of Type IV. I find for the 
constants : 

Mean = 4*8685 lips. 

7X.3 = -309,006, o- = -55588 

/X3 = - -350,697 

,ji^ =z 1-181,718 

^j r= 4-16834 

^83= 12-37598 

K^ = 6-24694 

Kg = 1-06594 

Since k^ is so nearly unity we may use Type V. 
Hence I find : 

p= 8-66184 y=:- 8-811634 

(y must be negative since /xg is negative) 

/x/= 1-32270. 

Thus the curve starts at 6-19118 lips, or the one medusa with seven lips is 
theoretically excluded. Here I have worked with the uncorrected moments 
because the lips are discontinuous variants. Working with Sheppabd's corrective 
terms the limit is about six lips, and with the corrective terms suggested in my 
memoir on skew variation the limit is 7 '6 5. Further we have : 

log 2/0 = 6-829,3633, 

distance from mean to mode = '30541 

Sk. = -54941. 

The mode is thus at 5-17389, in good agreement with observation. 
The equation to the curve is, taking x positive from 6-19118 lips towards lesser 
values : 

log y = 6-829,3633 - 8-66184 log«; ~ --^^^-''- . 
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This curve was drawn on a large scale and its areas read off with an 



integrator 



The following theoretical frequencies were obtained : 



No. of lips. 
6 and over 

5 
4 






1 



Observation. 

50 
798 

123 
1.8 



5 



2 



Calculation. 

47 
762 
160*5 

20 



5 



1%5 



There would not be any serious divergence here, were it not for the group with 
four lips, which observation shows to be much under-represented. But it must be 
remembered that we have only seven groups, and that such a number is very 
insufficient for a good determination of the moments of a curve. Further, the 
variation is not really continuous^ as indicated by the curve, but discrete. We have 
at present no clear statement as to how the moments of a discrete system of variation 
should be modified or corrected so as to give the best results for the moments of the 
continuous curve which is to theoretically represent the series. I am doubtful 
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whether Sheppard's corrections— the best for continuous variation— are equally 
appropriate in this case. Above I have used merely the rough moments, but I 
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have found by considerable experience that in the case of discrete variables, to treat 
the system as a polygon and correct, as in my memoir on Skew Variation (' Phil. 
Trans./ A, vol. 186, p. 350), appears to give the best results when the areas are 
compared with the discrete groups. The point wants further investigation ; when 
we have a large number of groups it is of little importance, but it makes a consider- 
able difference in these excessively skew distributions of discrete variables when the 
number of groups are small. "^^ 

Above all, the diagram (fig. ii.) shows how all important it is to compare areas and 
not merely the ordiiiates of the frecj[uency curve with the blocks representing the 
discrete frequencies in such a case as this. The wide-spread custom among foreign 
investigators of comparing merely the ordinates of the theoretical frequency curve 
with the observed frequencies leads in such cases to most fallacious results. 



(8.) (C.) On the Distribution of Incidence of Scarlet Fever Cases loith Age. 

It seems desirable to give an illustration of the method of dealing with a distri- 
bution which falls under the class dealt with in Section (5) of this paper. Dr. 
Macdonell, in dealing with the intensity of incidence of different diseases at various 
ages, has come across in scarlet fever a good illustration of curves of the types now 
under consideration. The whole of the arithmetical work on the present example is 
due to him, and I have to thank him very heartily for allowing me to use it here. 

The statistics are taken from the ' Report of the Metropolitan Asylums Board' 
(Statistical Part, 1899), They involve 39,253 male cases, distributed as follows : — 



Year of life. 


Frequency. 


Under 1 


443 


1 2 


1456 


2-3 


2631 


3-4 


3599 


4 5 


3862 


5-10 


15791 


10-15 


7359 


1 5-20 


2366 



Year of life. 


Frequenc}' 


20-25 


926 


25-30 


420 


30-35 


215 


35-40 


91 


40-45 


45 


45-50 


26 


50-55 


17 


55-60 


5 


60-65 


L 



The data being grouped partly in one and partly in five-year periods the moments 
had to be calculated with caution, separating the material into two pieces. Taking 
five years as the unit. Dr. Macdonell found for the uncorrected moments : 



^ E.g., petals of buttercup>s, teeth on the carapace of prawns, lips of medusse, as compared with veins on 
chestnut leaves, florets on ox-eyed daisy, &c. 
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Mean age of incidence, 8 '60975 years, 

11^ - 1-369,345 

fjjo —"• O /jOO ^ 1. J -i- 

/x^^ :=z: 19-143,575. 

The moments were not modified by She:pp,ABd's corrections, for these suppose contact 
of a high order at both terminals of the cur^e, and it was quite apparent that the 
curve must rise at a finite ano:le on the biith sidee Tiie JoUowino; additional constants 
were then determined : -— 



^1 



4 



K \J i I ^^ Zl jL) , 



Kj_ z=z 2*205,000, 



X,— ]U"S09,333, 

K, ::::= 2°8i3 783. 



Thus Ko is >1 and <cxd and the distribution is of Type VI. Now let us suppose the 
incidence of scarlet fever to start with birth, althouo'h there m.ie;ht, as in the case of 
enteric fever, be really some antenatal cases/''' 
Turning to Section (5) we ha^ 



V O 



c :=: distance from birth to mean =: 8^60975 years = 1 '72195 units. 



Hence we deduce 



y, zz: -461,819, 



y^:^ '685,596, 



And so from (xxix,) 






9 



10-532,485, 



f/jj -f g, =.- 15-417,281 ; 



or, 



Then from 



we find 



5] = 12-974,883; 



93 



A 4-4t<ii, o «y o 1 



C, 



H 



a ^ 4^2687104. 



2 



and, fixually, after determining y^ from (xxi. 

log?/o — 13'652,5078, 
Thus the values of the frequency are given by 

logy — 1.3-652,5078 + 2'442,398 log (;r-^4=268,104) - 12'974,883 log^'. 
The origin of the curve is thus 4*268,104 before birth. The mode is given by 



x,^^ ^ ciqi/{qi—'qo) = 5^257,842, 



Thus : X 



mo 



a 



•989,738 :::= 4'94869 yrs 



"^ See 'Phil Trans./ A, vol. 186, p. 390, The remarkably sharp rise of tlie scarlet-fever distribution 
as compared witli the enteric is, however, much against this. 
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This gives for ynwae the value 3892. 

Distance between mode and mean = 3*66106 yrs. 

Whence we find for skewness the value 

The diagram (fig. iii.) shows that the fit may he considered a good one. 



-^,000 




i^a of incideiwe. 



(9,) The conclusions of this paper are, I think, of some interest from the general 
standpoint of scientific investigation, A certain num^ber of frequency distributions 
had been found, not only by my co-workers and myself here, but by biologists in 
America, not to fit into the general system of skew distributions dealt with bv me in 
my original memoir. The first conclusion was that however wide-reaching that 
system appeared to be, it was a failure for a few remarkably skew distributions. But 
on more careful investigation of the differential equation it appeared tha,t two types 
of solution had been left out of consideration, and that these were precisely those 
needed in the recorded cases of faihrre. 

I owe some apology to authors like Professor Davenport and Dr. Dukckeh, who 
have recently issued text-books on the appHcation of statistical methods to biological 
variation, because although we have known and used these curves for some years past, 
no account has hitherto been published of them, and, consequently, biological 
investigators^^^ using their rhvmws of my methods have be.en, and I fear still may be, 
occasionally puzded. 

"^ E.g., Mr. A. G. Mayer in the paper on MeckistB referred to above 
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